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ABSTRACT 

LIN28 function is fundamental to the activity and 
behavior of human embryonic stem cells (hESCs) 
and induced pluripotent stem cells. Its main roles 
in these cell types are the regulation of transla- 
tional efficiency and let-7 miRNA maturation. How- 
ever, LIN28-associated mRNA cargo shifting and re- 
sultant regulation of translational efficiency upon 
the initiation of differentiation remain unknown. An 
RNA-immunoprecipitation and microarray analysis 
protocol, eRIP, that has high specificity and sen- 
sitivity was developed to test endogenous Dis- 
associated mRNA cargo shifting. A combined eRIP 
and polysome analysis of early stage differentiation 
of hESCs with two distinct differentiation cues re- 
vealed close similarities between the dynamics of 
LIN28 association and translational modulation of 
genes involved in the Wnt signaling, cell cycle, RNA 
metabolism and proteasomal pathways. Our data 
demonstrate that change in translational efficiency 
is a major contributor to early stages of differentia- 
tion of hESCs, in which LIN28 plays a central role. 
This implies that eRIP analysis of LIN28-associated 
RNA cargoes may be used for rapid functional quality 
control of pluripotent stem cells under manufacture 
for therapeutic applications. 



INTRODUCTION 

LIN28 is an evolutionarily conserved RNA-binding pro- 
tein (RBP) and a key regulator of developmental timing 
(1). LIN28 knockout mice showed reduction of the germ 
cell pool, and were are unable to survive past birth (2,3). 
LIN28 is highly expressed in both undifferentiated mouse 
and human embryonic stem cells (mESCs and hESCs) as 
well as developing tissues, with its expression decreasing 
upon differentiation (4-6). Along with key transcription 
factors OCT4, SOX2 and NANOG, LIN28 has been used 
to reprogram adult human fibroblasts to induced pluripo- 
tent stem cells (7), and was shown to be important for the 
maturation of these reprogrammed cells (8). 

LIN28 is a predominantly cytoplasmic protein that asso- 
ciates with RNA in stress granules, P-bodies and polysomes 
(9). LIN28 also binds to the terminal loops of let-7 miRNA 
family precursors and inhibits their processing into mature 
miRNAs (10-14). This is important in the regulation of dif- 
ferentiation (15,16), especially as LIN28 and let-7 form a 
regulatory negative feedback loop (17). Interestingly, let-7- 
independent translation regulation by LIN28 occurs before 
the /e?-7-dependent step in Caenorhabditis elegans (18,19). 
LIN28 enhances translation, in a /e?-7-independent man- 
ner, of mRNAs important for cell growth in embryonic stem 
cells via the recruitment of RNA helicase A to polysomes 
(20-22). In both hESCs and somatic cells, LIN28 has also 
been shown to regulate splicing factor abundance (23). In 
mature cells, LIN28 binds to and enhances the translation 
of mRNAs for several metabolic enzymes, thereby increas- 
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ing glycolysis and oxidative phosphorylation, thus driving 
tissue repair by reprogramming cellular metabolism (24). 
Conversely, it was recently shown that LIN28 could also act 
to repress translation of endoplasmic reticulum (ER) asso- 
ciated transcripts (25), suggesting that post-transcriptional 
regulation by LIN28 could occur both positively and nega- 
tively, and thus more pervasive than previously thought. 

It is unclear whether /er-7-independent translational reg- 
ulation by LIN28 occurs in hESCs upon the initiation 
of differentiation. RNA cargo shifting may be influenced 
by existing active let-7 molecules and so /e?-7-independent 
regulation is defined here as the period in which mature 
let-7 miRNA levels remain constant. It is also unknown 
what proportion of mRNAs are translationally activated 
or suppressed upon increased or decreased association with 
LIN28 during early differentiation of hESCs, and whether 
various differentiation cues direct early cellular changes 
through common and/or distinct LIN28-associated regu- 
lated pathways. Another driving force for this work was 
to establish a robust framework and database to analyze 
rapidly the functional quality of pluripotent stem cells dur- 
ing industrial production, as this is an essential component 
of the manufacturing process of cells destined for therapeu- 
tic applications. 

To identify mRNAs associated with endogeneous 
LIN28 in hESCs, an enhanced non-cross-linking RNA- 
immunoprecipitation and microarray analysis technique 
(eRIP) was developed, as cross-linking-based protocols 
have been shown to introduce sequence biases and increase 
unspecific binding (26,27). Molecular crowding has been 
shown to stabilize folded RNA structure based on the 
principle of the Excluded Volume Effect (EVE) (28). In 
addition, we have demonstrated previously that the incor- 
poration of molecular crowders into enzymatic reactions, 
such as real-time PCR, increases sensitivity by up to 10-fold 
though a number of molecular effects, including stabilizing 
protein-nucleic acid interactions (29). The inclusion of 
molecular crowders during the immunoprecipitation step 
of eRIP improved specificity and reduced background 
signal. Underscoring the sensitivity of the method, eRIPs 
were performed with less than a million cells per sample, 
10- to 20-fold less than traditional RIP and equivalent 
cross-linking-based protocols (21,25). This methodology 
improvement also allowed multiple testing from the same 
small cell batch. 

Analysis of the dynamic changes of LIN28 association 
with its target mRN As upon the onset of differentiation of 
hESCs to trophoblast and neural lineages was conducted 
using eRIP, where the results showed consistently that the 
majority of these associations decrease upon short-term dif- 
ferentiation, prior to any change in mature let-7 miRNA 
levels. Utilizing polysome loading of mRNAs as a read- 
out for translational efficiency, we demonstrate that 95% of 
LIN28-associated transcripts decrease in translational effi- 
ciency within 24 h of trophectoderm-induced differentia- 
tion in hESCs. Of these, 750 increase, while 511 decrease, 
in LIN28 association. Crucially, the majority of these tran- 
scripts were common when a similar analysis was conducted 
with a neural differentiation protocol, including novel tar- 
gets such as L1TD1, PKM2, ID1, PIN1 and CSE1L. Gene 
ontology and pathway analysis reveal that LIN28 modu- 



lates cell cycle, RNA metabolism and Wnt signaling path- 
ways through a central core of proteasomal genes for the 
set of 750 mRNAs, and NGF, TNF-alpha/NF-K0 and IL- 
2 signaling pathways for the set of 5 1 1 transcripts. We thus 
provide evidence that LIN28 is indeed a central regulator 
of stem cell pluripotency and differentiation, independent 
of let-7 & miRNA regulation. 

MATERIALS AND METHODS 

Cell culture 

HI hESCs (Wicell) were cultured on Biomatrix in feeder- 
free culture conditions with mouse embryonic fibroblast- 
conditioned media (MEF-CM) (Tan et ah, submitted for 
publication). Briefly, human embryonic stem medium [80% 
Dulbecco's modified Eagle's medium-Ham's F-12 medium, 
20% Knockout serum replacement (Invitrogen), 1 mM 
L-glutamine, 0.1 mM (3-mercaptoethanol, 1% nonessen- 
tial amino acids and 4 ng/ml human basic fibroblast 
growth factor (bFGF)] was incubated with mitomycin C- 
inactivated MEFs overnight, with an additional 4 ng/ml 
of bFGF before hESC feeding. Cultures were passaged 
every 5-7 days, before they became confluent. Differen- 
tiation was induced via the addition of specific factors 
into MEF-CM, along with the removal of bFGF. For tro- 
phoblast induction, cultures were incubated in MEF-CM 
with 100 ng/ml of recombinant BMP4 (R&D Systems) and 
20 |xM of FGFR-inhibitor SU5402 (Calbiochem) for 1-5 
days (30). For retinoid differentiation, cultures were incu- 
bated in MEF-CM with 10 |xM of synthetic retinoid EC23 
(Reinnervate) for 1-5 days (31). 

Quantitative real-time PCR 

RNA was extracted from biological triplicate samples us- 
ing a combination of Trizol lysis (Invitrogen) and the 
RNeasy kit (Qiagen). 100-500 ng of RNA from each sam- 
ple was reverse transcribed using the high capacity cDNA 
reverse transcription Kit for mRNA analysis, or the Taq- 
Man miRNA reverse transcription kit for miRNA analysis 
(ABI). All PCR reactions were performed in technical du- 
plicates and biological replicates using validated TaqMan 
gene expression assays as per manufacturer's instructions 
on the 7900HT fast real-time PCR system (both ABI). Ex- 
pression data were normalized to GAPDH (for mRNAs) or 
RNU6B (for miRNAs) expression levels for each sample. 

siRNA transfection 

siRNA transfection was performed as described previously 
(32). Briefly, 1 x 10 5 hESCs in suspension were trans- 
fected using DharmaFECT2 (Dharmacon), with 50 |xM of 
either non-targeting siRNA control or ON-TARGETplus 
SMARTpool siLIN28 (both Dharmacon) as per manufac- 
turer's recommendations, and plated onto 12-well tissue cul- 
ture plates. Protein expression was analyzed 72 h after trans- 
fection. 

Western blots 

Protein extraction and western blots were performed as de- 
scribed previously (33). Briefly, cells were lysed in RIPA 
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buffer and cleared. Proteins were analyzed by SDS-PAGE 
(Invitrogen), transferred to nitrocellulose membranes and 
probed with the following antibodies: LIN28 (Protein 
Tech), PKM2 (Novus Biologicals), EZR, ACTB, KPNA2, 
PNP and DNMT3B (all Santa Cruz). 

Microarrays 

HT-12 v3.0 Beadchip microarrays (Illumina) were used for 
both eRIP and polysome experiments. cRNA libraries were 
prepared (Ambion) and samples hybridized to Beadchips 
(Illumina) using standard manufacturer's protocols. Data 
were extracted with BeadStudio software (Illumina), fol- 
lowing quantile normalization across all samples. 

Polysome fractionation 

Polysomes were isolated as previously described (34,35). 
Cells were incubated with 100 |xg/ml cyclohexamide for 10 
min prior to harvesting. Ten million cells were harvested us- 
ing accutase and resuspended in 10 mM Tris-HCl at pH 
7.4, 10 mM NaCl, 15 mM MgCl 2 , 200 |Ag/ml cyclohex- 
imide, 200 |xg/ml heparin and 2000 U/m RNasin. Cells 
were then lysed in 0.5% Triton X-100, 1% Tween-20, 0.5% 
deoxycholate. Ly sates were incubated on ice for 10 min and 
nuclei were removed by centrifugation at 12 000 x g for 10 
min. Equal OD units were loaded onto linear 10-50% su- 
crose gradients (in 10 mM Tris-HCl at pH 7.4, 75 mM KC1, 
1.5 mM MgCl 2 ) and centrifuged at 36 000 rpm for 2 h at 
8°C in an SW41 rotor (Beckman Coulter). A piston gra- 
dient fractionator (BioComp Instruments) was used to col- 
lect twelve 1 ml fractions. Fractions were incubated with 1% 
SDS and 120 u,g of proteinase K (Invitrogen) for 30 min at 
42°C. Fractions 1-5, 6-8 and 9-1 1 were combined as groups 
1, 2 and 3, respectively. Unfractionated cytoplasmic RNA 
and polysomal RNA groups were purified with phenol chlo- 
roform extraction, followed by purification on an RNeasy 
column with on-column DNase digestion. Poly(A) spike -in 
RNAs (Ambion) were added to each group of RNA. 300 ng 
of RNA from each group was used for microarray analysis. 

Polysome microarray analysis 

The hybridization intensities were first background sub- 
tracted. A two-way normalization was then performed. 
First, a scaling factor was obtained from spike -in controls 
to normalize the intensities between each polysomal group. 
Next, normalization was performed among the four repli- 
cates. For total RNA, the hybridization intensities were 
background subtracted and normalized using the cross- 
correlation method (36). 

Excluded volume effect RNA immunoprecipitation (eRIP) 

Cross-linking-free RNA immunoprecipitation was per- 
formed as previously detailed (27), with the addition of a 
cocktail of molecular crowders [2.5 mg/ml Ficoll PM400, 
7.5 mg/ml Ficoll PM70 (both GE Healthcare) and 250 
ng/ml Dextran Sulphate 670k (Fluka)] to the immunopre- 
cipitation step. Briefly, 10 (jlI of antibody [LIN28 (Protein- 
Tech) or Ul (Santa Cruz)] was incubated with BSA-blocked 



Protein G sepharose beads (Sigma) overnight, washed five 
times with NT2 buffer, and resuspended in 850 |xl of NT2 
buffer with molecular buffer cocktail and 200 U of RNase 
Out and 100 U of SUPERase IN (Life Technologies). Hu- 
man ESC cultures that were 50-70% confluent (1-5 x 10 6 
cells) were then washed with phosphate buffered saline (In- 
vitrogen) and lysed with 500 u.1 of polysome lysis buffer. 
Nuclei were removed by centrifugation at 5000 x g for 5 
min. 100 |xl of the cytoplasmic lysate was then added to 
900 |jl1 of antibody-coated bead slurry from above for each 
sample, and rotated at 4°C overnight. The next day, beads 
were washed five times with NT2, and the RNA eluted with 
Trizol reagent (Invitrogen), followed by standard RNA pre- 
cipitation protocols. The RNA was resuspended in 50 \jA of 
RNase -free water and ready for either real-time PCR or mi- 
croarray analysis, where volumes were kept constant across 
all samples. 10 |xl of RNA was used for each sample for 
preparation of cRNA for microarray analysis. 

LIN28 eRIP microarray analysis 

Samples were analyzed in biological triplicate for all four 
experimental conditions; untreated, ligand-induced tro- 
phoblast lineage differentiation protocol (TE) at 12 h, TE 
at 24 h, and ligand-induced neuronal lineage differentiation 
protocol at 24 h. Samples were quantile normalized across 
the full dataset, the quality of the microarray data was as- 
sessed using BeadStudio software (Illumina) and the simi- 
larity of the replicates verified by unsupervised hierarchical 
clustering. A single outlier array was detected and excluded 
from the subsequent analysis. A T-test was used to compare 
the gene expression levels from the LIN28 and Ul control 
eRIP microarrays, and a ^-value cutoff of 0.01 was applied. 
Probes were also filtered by the eRIP fold change. To deter- 
mine an appropriate fold change cutoff the eRIP false pos- 
itive rate was estimated at discrete cutoff levels between 0.5 
and 3 from the number of probes binding to the Ul control 
eRIP. Increasing the cutoff reduced the false positive rate 
until a fold change of 2, corresponding to a false positive 
rate of approximately 1.75%, after which only a small de- 
crease was observed, and so a 2-fold cutoff was used for the 
analysis of the LIN28-eRIP. 

LIN28 eRIP timecourse analysis 

The change in LIN28 binding over the trophoblast lineage 
differentiation timecourse was quantified using a linear fit 
to the LIN28 versus Ul control eRIP fold change across 
the three time -points, using all experimental replicates. A 
scaled gradient was defined as the LIN28 gradient divided 
by the standard error in the gradient. An equivalent neu- 
ronal lineage differentiation gradient was determined but, 
as there are only two neuronal-differentiation time-points, 
no scaled gradient was defined. 

Pathway enrichment analysis and network visualization 

For each of the LIN28 gene lists a />-value of over- 
representation in a suite of canonical pathways (KEGG, 
Wikipathways and Reactome) was determined using the hy- 
pergeometric distribution. A visualization of the physical 
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interactions between the LIN28-bound transcripts in the 
top five enriched pathways was rendered using the Gene- 
mania plugin (37), within Cytoscape software (38). 

Analysis scripts 

The R scripts used to conduct the microarray analysis and 
compile the associated figures in this manuscript have been 
compiled as a Sweave file, available as supplementary infor- 
mation. 

Accession numbers 

LIN28 eRIP and polysome microarray datasets are 
available in the ArrayExpress database (www.ebi.ac.uk/ 
arrayexpress) under accession numbers E-MTAB-1652 and 
E-MTAB-1653, respectively. 

RESULTS 

Characterization of differentiating hESCs 

The initial 24 h of defined directed differentiation of hESCs 
was used to investigate the dynamic interactions of endo- 
geneous LIN28 with its target mRNAs during the initia- 
tion of differentiation. Two distinct established differenti- 
ation protocols were used, rather than random differenti- 
ation via embryoid body formation. Trophectoderm (TE) 
differentiation was induced by the addition of BMP4 and 
FGFR-inhibitor SU5402 (30), and neural (N) differentia- 
tion induced by synthetic retinoid EC23 (31). Within 24 h 
of directed differentiation of HI hESCs, transcript expres- 
sion of early trophectoderm markers FGFR3 and CDX2 in- 
creased dramatically only in cells treated with the TE dif- 
ferentiation protocol. An increase in the expression of neu- 
ral markers PAX6 and SOX9 was detected only for cells 
treated with EC23 (Figure 1A). Both differentiation proto- 
cols induced a 50% decrease in mRNA levels for pluripo- 
tency markers NANOG and SALL4 within 24 h. However, 
no change was detected for OCT4 and LIN28 transcripts 
upon differentiation (Figure IB). The levels of mature let- 
7 miRNAs and miR-lOa remain unchanged in the first 24 
h of differentiation (Figure 1C). Conversely, levels of miR- 
296 increased as has previously been shown upon retinoid 
differentiation (33). These results are consistent with estab- 
lished data for induced differentiation of hESCs and with 
the previous findings that LIN28 expression decreases, and 
mature let-7 expression increases, only after 5-10 days of 
embryoid body differentiation in hESCs (11). 

Establishing quality of LIN28 eRIP and polysome profiling 

We previously employed the concept of EVE to dra- 
matically improve the specificity and sensitivity of real- 
time PCR (29). Here we expand the utility of EVE in a 
non-crosslinking RNA-immunoprecipitation (RIP) proto- 
col (27), which we term eRIP. The eRIP method introduces 
neutral and hydrophilic macromolecular crowders during 
the immunoprecipitation step to more closely emulate the 
intracellular biophysical environment and reduce method- 
induced deviations in molecular interactions, as well as 



maximize the capture efficiency and specificity of antibody- 
RBP-RNA complexes from hESC cytoplasmic extracts. 
Western blot analysis of LIN28 protein in the immuno- 
pricipitate and flow-through from conventional RIP com- 
pared to eRIP indicated that antibody-LIN28 affinity was 
enhanced in eRIP (Figure ID), illustrating the improved ef- 
ficiency of our method. Illumina beadchip microarray anal- 
ysis was performed on HI hESC total RNA and eRIPs 
with beads only and anti-Ul negative controls, and anti- 
LIN28, versus treatment types: untreated cells, neural dif- 
ferentiation for 24 h (N24) and trophectoderm differentia- 
tion for 12 and 24 h (TE12 and TE24). eRIP enrichment 
values for all LIN28-associated genes were generally cor- 
related between the two differentiation protocols at 24 h, 
with an R 2 value of 0.59, suggesting that LIN28-mediated 
post-transcriptional regulation is involved in the initiation 
of both differentiation programs (Figure IE). Unsupervised 
hierarchical cluster analysis of this dataset demonstrated 
that each sample set clustered together distinctly with itself, 
where the transcript representation profile for each treat- 
ment of each sample was similar (Figure IF). Importantly, 
the beads only and Ul negative controls all clustered to- 
gether with a similar pattern indicating coherent and re- 
producible background control. As such, the method was 
deemed fit for use for research purposes, but has yet to be 
qualified to ICH guidelines. It is interesting to note here that 
the neural and trophectoderm eRIP patterns were similar in 
all replicates. Similar relative patterns were also observed in 
all total RNA samples, suggestive that many, but not all, 
mRNA levels do not alter significantly within the first 24 
h of hESC differentiation and that modulation in transla- 
tional efficiency may be a key factor in early differentiation 
responses. 

Transcripts significantly associated with LIN28 were 
identified by differential expression analysis (39), compar- 
ing LIN28 and Ul eRIP arrays (Figure 2A). A total of 
1984 transcripts were significantly associated with endo- 
geneous LIN28 in undifferentiated hESCs (Figure 2B and 
Supplementary Table SI). A recent study demonstrated 
that LIN28 binds selectively to quadruplex-forming G-rich 
RNA sequences (QGRS) (O'Day et al. , submitted for publi- 
cation). To substantiate and endorse this gene list, we com- 
pared the QGRS G-score per kb (40) for the top 50 LIN28- 
associated mRNAs detected herein with the top 50 LIN28- 
associated mRNAs detected by two comparable studies 
(21,41), and the top 50 mRNAs associated with G-quartet- 
binding FMRP as a positive control (42). As negative 
controls, a randomly generated list of 50 mRNAs (RSA- 
tools, http://rsat.ulb.ac.be/random-genes_form.cgi) and the 
top 50 mRNAs associated with the AU-rich binding HuR 
were used (43). A cumulative frequency plot of these data 
confirms that all three of the LIN28-associated datasets are 
significantly enriched for G-rich, quadruplex-forming se- 
quences, with Kolmogorov-Smirnov (K-S) test p-values of 
< 0.05 compared to the random gene list (Figure 2C). While 
it can be challenging to directly compare across different 
studies due to methodological differences, such as IP pro- 
tocol, cross-linking and capture of endogeneous versus ex- 
ogeneous LIN28 in different cell lines, eRIP did identify 
53% of the 276 high-confidence targets in the study, by Peng 
et al. , which captured exogeneous FLAG-tagged LIN28 via 
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Figure 1. Analysis of 24 h trophectoderm (TE24) and neuronal retinoid (N24) differentiation. Quantitative real-time PCR expression analysis of (A) 
differentiation markers, (B) pluripotency markers and (C) miRNAs comparing hESCs induced to differentiate with either TE24 or N24 to undifferentiated 
cells. Trophectoderm markers FGFR3 and CDX2 were only upregulated upon TE24 treatment, while PAX6 and SOX9 were only upregulated upon N24 
treatment. NANOG and SALL4 were downregulated, whereas L1N28 and let-7 levels remained unchanged with both differentiation protocols. (D) Western 
blot of LIN28 protein levels in cytoplasmic lysates, 5-fold-diluted immunoprecipates (IP), and 10-fold-diluted first flow-throughs (FT) for conventional 
RIP compared to enhanced RIP (eRIP), illustrating the increased efficiency of eRIP. (E) Comparison of LIN28-enriched genes for both TE24 and N24 
differentiation treatments, showing a correlative R value of 0.59. (F) Unsupervised hierarchical clustering of Illumina beadchip microarrays showing 
differences between control (beads-only and Ul nuclear RBP) eRIPs, LIN28 eRIPs and total RNA expression profiles. Error bars depict the S.D. of three 
independent experiments. (*p < 0.05). 



non-cross-linking RIP in HI hESCs (21). Forty-five percent 
of eRIP targets were also identified in the Wilbert et al. 
study, which used cross-linking IP (CLIP) to capture en- 
dogenous LIN28 in H9 hESCs (23). Twenty-four percent 
of 1803 LIN28-associated genes observed by Hafner, where 
photoactivatable-ribonucleoside-enhanced CLIP was used 
to capture exogeneous LIN28 in HEK293 cells (41), were 
identified by eRIP. Overall, eRIP data show a higher cor- 
relation with non-cross-linking RIP methodology used to 
identify endogeneous LIN28-associated mRNAs. In ad- 
dition, 50% of the LIN28-associated transcripts identi- 
fied herein are novel and not present in any of the three 
previously published datasets. These data may reflect the 
broader spectrum of potential LIN28 targets whose asso- 
ciation with LIN28 may vary across cell types due to dif- 
ferences in aspects such as dynamic balances of RNAs (in- 
cluding small non-coding RNAs) and other RBPs, as well 
as phosphorylation/acetylation states of the endogeneous 
protein. 



Genome -wide polysome profiling is an established in- 
dicator of translational efficiency, where greater associa- 
tion of mRNA with polysomes is an indicator of increased 
translation efficiency, and vice versa (34,44). This was ap- 
plied to explore the translational regulatory landscape of 
these hESCs and how this related to LIN28-associated tran- 
scripts. Polysomal fractions were separated into four dis- 
tinct groups and the translational efficiency of each tran- 
script was computed as a ratio of the microarray signal 
from heavy to light polysomes (or G3 to G2) (Figure 2D 
and Supplementary Table S2). This can be visualized in a 
Q-Q plot comparing experimental to theoretical T-scores 
(Figure 2E), which is distinctly different from the LIN28- 
association plot in Figure 2A, where we did not detect 
much enrichment in the negative direction (i.e. for Ul- 
association). To substantiate the microarray dataset, quan- 
titative real-time PCR was performed for 12 randomly se- 
lected mRNAs across all of the gradient groups, and both 
datasets were highly correlated (R 2 — 0.83) (Figure 2F). 
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Figure 2. Analysis of LIN28 eRIP and polysome profiling. (A) Q-Q plot comparing the experimental and theoretical T-scores for LIN28 compared to Ul 
eRIPs in untreated hESCs. Data points above and below the blue line represent mRNAs that deviate from the expected distribution, where positive scores 
represent LIN28 compared to Ul RIP enrichment and vice versa. T-scores above 1 (yellow), 2 (orange) and 5.841 (red, equivalent to a p-value of <0.01) 
are highlighted. (B) Volcano plot illustrating the relationship between />value and fold change for LIN28 compared to Ul eRIPs. Of the mRNAs that have 
a />-value of <0.01, 1,984 have a fold enrichment value of >2 (orange box), suggesting that these mRNAs are associated with LIN28 in undifferentiated 
hESCs. (C) Box plot of the total quadruplex-forming G-rich RNA sequences (QGRS) score per kb of the top 50 transcripts associated with various RNA- 
binding proteins. Compared to an RSA-generated random list of 50 genes, FMRP and all LIN28 datasets were enriched for such sequences, but HuR, 
known to target AU-rich regions, was not. ** indicates a K-S test p-value of < 0.01, and * p-value < 0.05, compared to the list of random genes. (D) 
Schematic of representative spectra for the polysome analysis experiments, showing the 12 fractions collected and how they are grouped for microarray 
analysis. (E) Q-Q plot of the translational efficiency ratio (calculated as the ratio of G3:G2) for undifferentiated hESCs. Data points above and below the 
blue line represent mRNAs that deviate from the expected distribution, where positive scores represent highly efficiently translated mRNAs and vice versa. 
(F) Quantitative real-time PCR validation of the polysome microarray data for 12 random genes, resulting in a correlative R 2 value of 0.83. 



LIN28 association and translational efficiency upon hESC 
differentiation 

Considering all three time-points and both differentiation 
protocols, 2995 transcripts were significantly associated 
with LIN28 in at least one time point (Supplementary Table 
SI). To identify and compute changes at the onset of dif- 
ferentiation, we calculated the timecourse gradient of total 
mRNA expression, LIN28 association, and translational 
efficiency for each transcript across the three time-points: 
untreated (or undifferentiated), 12 h TE and 24 h TE treat- 
ment. A positive gradient value indicates an increase in a 
particular observation, such as LIN28 association, upon 24 
h of hESC differentiation into the trophectoderm lineage. 
We defined the scaled gradient as the ratio of the gradient 
to the standard error of the gradient. Transcripts with scaled 
gradients between — 1 and 1 (i.e. that the magnitude of the 
standard error is greater than the gradient) were disregarded 
from this part of the analysis. 



We compared the change in transcript expression dur- 
ing trophoblast differentiation of the LIN28-associated 
transcripts identified in this study, the significant LIN28 
targets from the Peng study (21), known trophectoderm 
and pluripotency genes (45), and all expressed genes (Fig- 
ure 3A). As expected, transcript levels of trophectoderm 
marker genes significantly increased upon TE differentia- 
tion, while transcript levels of pluripotency-related genes 
significantly decreased. The LIN28-associated transcripts 
from both datasets showed little change, demonstrating 
that these genes were not co-regulated at the transcrip- 
tional level. When the change in translational efficiency 
was compared for the same lists of genes, only the LIN28- 
associated transcripts from this study showed a significant 
decrease in translational efficiency upon differentiation (K- 
S test p-value < 0.001) (Figure 3B), supporting our hypoth- 
esis that LIN28-associated transcripts are regulated at the 
post-transcriptional level. The majority of transcripts (750, 
or 56% of LIN28-associated transcripts) decreased in both 



Nucleic Acids Research, 2014, Vol. 42, No. 12 8003 




Figure 3. Gradient analysis of total RNA and translational efficiency upon differentiation. Cumulative distribution plots of (A) scaled total RNA gradient 
and (B) scaled translational efficiency gradient for hESCs undergoing 24 h of trophoblast differentiation. A positive gradient indicates that a particular 
gene is increasing in either mRNA expression (A), or translational efficiency (B). K-S tests identified significant differences between 'all genes' (black) and 
'pluripotency marker' (blue) or 'trophectoderm marker' (green) genes for total RNA in (A), and between 'all genes' (black) and 'Tan LIN28 genes' (orange) 
for translational efficiency in (B), denoted by * p-value < 0.001. (C) Statistics of genes significantly changing in either LIN28 association or translational 
efficiency upon T24 differentiation. A large proportion of genes are significantly decreased in both LIN28 association and translational efficiency. (D) 
Cumulative distribution plot of LIN28-association gradient for hESCs undergoing neural differentiation (N24). All genes associated with LIN28 (black) 
were significantly different from genes that are both decreasing in association with LIN28 and decreasing in translational efficiency upon T24 (red), showing 
that these genes are also decreasing in LIN28 association upon N24 differentiation. * denotes a K-S test p-value of < 0.001. (E) Representative western 
blots comparing non-targeting siRNA control-transfected (siNT) to siLIN28-transfected (siLIN28) hESCs at 72 h. ACTB was used as a loading control, 
while EZR was a non-LIN28-associating control. 



LIN28 association and translational efficiency upon differ- 
entiation (Figure 3C). Analysis of hESCs induced to differ- 
entiate with the neural protocol showed a similar pattern of 
response to trophoblast differentiation (Figure 3D). Of all 
the LIN28 mRNA cargoes identified in this study, 66% shift 
in a similar fashion upon induction of differentiation with 
either protocol (Supplementary Table SI), including novel 
stem cell-related targets such as L1TD1, PKM2, ID1, PIN1 
and CSE1L. Western blot analysis of siRNA-transfected 
hESCs showed that perturbation of LIN28 levels lead to 
decreases in protein levels of PKM2, KPNA2, PNP and 
DNMT3B, but not ACTB and non-LIN28-associating con- 
trol EZR (Figure 3E). This pattern in protein level changes 
was also observed during differentiation (data not shown), 
where LIN28 and polysome associations with these mR- 
NAs were shown to be decreased. These data suggest that 
the modulation in translational efficiency due to changes in 
association of LIN28 with these transcripts are specific to 



the loss of pluripotency, rather than the method of differen- 
tiation induction. 

A gene ontology and pathway analysis of the subset of 
transcripts that decreased in both LIN28 association and 
translational efficiency upon trophoblast differentiation re- 
vealed significant enrichment in the Wnt signaling, cell cy- 
cle, RNA metabolism and proteasomal pathways (Figure 
4A). By comparison, targets that increased in LIN28 asso- 
ciation but decreased in translational efficiency upon differ- 
entiation were enriched for NGF, TNF-a/NF-Kp and IL- 
2 signaling pathways (Figure 4B). This implies that LIN28 
binding leads to the translational enhancement and sup- 
pression of different subsets of transcripts, and corrob- 
orates a recent study in mice describing LIN28 as both 
a suppressor of ER-associated translation and a transla- 
tional enhancer for a different set of mRNAs (25). A func- 
tional network constructed from the LIN28-associated, 
pathway-enriched genes in Figure 4A reveals a core set of 



8004 Nucleic Acids Research, 2014, Vol. 42, No. 12 



Pathway 


P - V a.ue 


Number of 
Genes 


Percentage in 
Pathway 


Proteasome 
(KEGG) 


1.7 E-9 


12 


26.7% 


Signaling by Wnt 
(Reactome) 


2.0 E-12 


17 


26.6% 


Cell Cycle Checkpoints 
(Reactome) 


6.2 E-12 


22 


18.8% 


Metabolism of RNA 
(Reactome) 


1.3 E-9 


23 


14.0% 


Cell Cycle Mitotic 
(Reactome) 


1.5 E-10 


35 


11.1% 



Pathway 




Number of 
Genes 


Percentage in 
Pathway 


IL-2 Signaling Pathway 
(Wikipathways) 


4.1 E-5 


8 


10.4% 


TNF-a/NF-kp' Signaling 
Pathway (Wikipathways) 


1.5 E-5 


14 


7.4% 


Signaling by NGF (Reactome) 


6.2 E-6 


16 


7.3% 


RNA transport (KEGG) 


1.2 E-4 


11 


7.2% 


Pathways in cancer (KEGG) 


9.6 E-5 


18 


5.5% 



Figure 4. Gene ontology analysis of LIN28-association and translational efficiency upon differentiation. (A) The top five pathways for the genes that 
significantly decreased in both LIN28 association and translational efficiency upon differentiation. (B) The top five pathways for genes that are increasing 
in LIN28-association gradient and decreasing in translational efficiency gradient. 



proteasome-related genes that are common for all pathways 
for trophectoderm differentiation (Figure 5). Importantly, 
of the 69 genes in this network that includes 15 proteaso- 
mal genes, 60 (including 14 proteasomal genes) were also 
decreased in LIN28-association upon neural differentiation 
(Supplementary Table SI). The identification of these early- 
response translationally regulated pathways suggests that 
LIN28 plays a far larger role during the initiation of differ- 
entiation, as well as control of pluripotency, than previously 
appreciated. 

DISCUSSION 

We have developed the sensitive eRIP method to track 
the dynamic changes of transcripts associated with endo- 
genous LIN28, and demonstrated that 95% of these also 
decrease in translational efficiency upon the onset of dif- 
ferentiation in hESCs. We show that the majority of tran- 
scripts that decrease in association with LIN28 also de- 
crease in translational efficiency, and provide evidence that 
LIN28 controls crucial cell cycle and RNA metabolism 
pathways via post-transcriptional regulation of a core set 
of proteasome genes. Importantly, these events are cap- 
tured prior to any changes in mature let-7 miRNA levels, 
supporting the observation in C. elegans development that 
fef-7-independent regulation by LIN28 occurs before let-7 - 
dependent steps (19). Of the 2995 transcripts that are as- 
sociated with LIN28 in this study, only a small number 



changed significantly in transcript expression upon differ- 
entiation, 26 with TE24 treatment and 7 with N24 treat- 
ment. This suggests that most LIN28-associated mRNAs 
are post-transcriptionally regulated, at least during the first 
24 h of differentiation. The reproducible pattern of LIN28- 
associated mRNAs in undifferentiated hESCs and the rapid 
changes in LIN28 mRNA cargoes upon the onset of dif- 
ferentiation offers the potential for development of useful 
eRIP-based in-process or release specifications assay to test 
the functional quality of pluripotent stem cells destined for 
therapeutic use. It is envisaged that this technology could 
be linked to a real-time PCR detection step where a narrow 
gene set would be used to assess pluripotency before, or even 
after, differentiation. 

Importantly, our eRIP method identified novel LIN28- 
associated transcripts. Those identified include L1TD1, a 
LIN28-interacting RBP required for hESC self-renewal 
(46), PKM2, which interacts with OCT4 and is enriched 
in hESCs, facilitating self-renewal and proliferation (47), as 
well as ID1 and PIN1, critical factors for self-renewal and 
maintenance of pluripotent stem cells (48,49). All four of 
these genes decrease in LIN28 association upon both differ- 
entiation treatments and translational efficiency upon TE 
differentiation. Interestingly, a novel target, CSE1L, that is 
essential for early embryonic growth and development (50), 
increases its association with LIN28 during both differen- 
tiation treatments, but decreases in translational efficiency 
upon TE-induced differentiation. 
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Figure 5. Network of genes that decrease in both LIN28-association and translational efficiency upon differentiation. Functional interaction network of 
genes in the top five pathways in Figure 4A. Each gene is represented by two semi-circles: the one on the left represents the LIN28-association gradient, and 
the one on the right represents the translational efficiency gradient, where white is 0 and black is — 1. Genes in bold indicate a negative LIN28-association 
gradient upon hESC differentiation with both trophoblast and neural differentiation protocols. 



The majority of the transcripts decreasing in both LIN28 
association and translational efficiency include those in 
known LIN28-regulated pathways such as cell cycle and 
RNA metabolism (9,21). Downregulation of LIN28 has 
been shown to be important in hESC differentiation (51). 
Given the essential role of the Wnt signaling pathway in 
the maintenance of hESC pluripotency (52), it is intrigu- 
ing that our data suggest that LIN28 regulates the initiation 
of differentiation through post-transcriptional downregu- 
lation of Wnt pathway genes. Of significance is the iden- 
tification of the proteasome pathway as a central node in 
our analysis. Studies have shown that the proteasome is im- 
portant for the maintenance of pluripotency; it is also com- 
monly overexpressed in human mature oocytes and hESCs, 
and downregulated upon differentiation (53,54). 

Recent work shows that LIN28 also functions as a trans- 
lational repressor for ER-associated genes (25). This phe- 
nomenon is reflected in our dataset, where the genes of three 
signaling pathways are increasingly associated with LIN28 
but decrease in translational efficiency upon hESC differen- 
tiation. Interestingly, decreased expression of NF-k|3 path- 



way genes has been detected in differentiating hESCs (55), 
while IL-2 and NGF have been used to induce differenti- 
ation of hESCs to natural killer cells and neurons, respec- 
tively (56,57), lineages that are distinctly different from the 
trophectoderm. As the ER orchestrates the production of 
membrane -bound and secreted proteins, which are encoded 
by many genes involved in signaling pathways, it is pos- 
sible that LIN28 localization to the ER alters its role in 
post-transcriptional regulation such that increased associ- 
ation with LIN28 induces decreased translation. As LIN28 
levels do not change upon the initiation of differentiation, 
it is likely that distinct partner RBPs and/or non-coding 
RNAs (ncRNAs) associate with LIN28-bound transcripts, 
as exemplified by 5C7-FMRP (58). Similar to the the- 
ory of eukaryotic post-transcriptional operons (59), these 
LIN28-associated mRNPs can recruit different factors to 
enhance translational efficiency, for example RNA helicase 
A as shown previously (20). Alternatively, RBPs such as 
CUGBP2, which has been shown to stabilize COX2 mRNA 
while suppressing its translation (60), could be recruited 
to LIN28-associated mRNPs. This hypothesis fits observa- 
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tions that LIN28 can stabilize mRNAs (41), as well as in- 
duce translational inhibition (25). It will therefore be inter- 
esting to perform LIN28 eRIP, coupled with mass spectro- 
metric analysis, to identify the protein-interacting partners 
in these mRNPs. 

We have demonstrated that LIN28 is a key translational 
determinant of the initiation of hESC differentiation. Al- 
though most of the LIN28-associated transcripts do not 
change in abundance upon differentiation, almost all de- 
crease in translational efficiency, whereas only close to two- 
thirds decrease in LIN28-association, as revealed by eRIP 
and polysome analyses. Pathways important for this pro- 
cess include Wnt signaling, cell cycle and RNA metabolism, 
which are connected by a central proteasome core. Our data 
thus suggest that in the pluripotent state, LIN28 acts to sus- 
tain the translational efficiency of its target genes through 
contrasting mechanisms, possibly by co-association with 
additional RBPs or ncRNAs. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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